Training Set Issues in SRI's DECIPHER Speech Recognition System

نویسندگان

  • Hy Murveit
  • Mitch Weintraub
  • Mike Cohen
چکیده

SRI has developed the DECIPHER system, a hidden Markov model (HMM) based continuous speech recognition system typically used in a speaker-independent manner. Initially we review the DECIPHER system, then we show that DECIPHER's speakerindependent performance improved by 20% when the standard 3990-sentence speaker-independent test set was augmented with training data from the 7200-sentence resource management speaker-dependent training sentences. We show a further improvement of over 20% when a version of corrective training was implemented. Finally we show improvement using parallel maleand femaletrained models in DECIPHER. The word-error rate when all three improvements were combined was 3.7% on DARPA's February 1989 speaker-independent test set using the standard perplexity 60 wordpair grammar.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic evaluation and training in English pronunciation

SRI is developing a system that uses real time speech recognition to diag nose, evaluate and provide training in spoken English. The paper first describes the methods and results of a study of the feasibility of automati cally grading the performance of Japanese students when reading English aloud. Utterances recorded from Japanese speakers were independently rated by expert listeners. Speech g...

متن کامل

Adaptation of hidden Markov models using multiple stochastic transformations

The recognition accuracy in recent large vocabulary Automatic Speech Recognition (ASR) systems is highly related to the existing mismatch between the training and test sets. For example, dialect di erences across the training and testing speakers result to a signi cant degradation in recognition performance. Some popular adaptation approaches improve the recognition performance of speech recogn...

متن کامل

Speech Recognition in SRI's Resource Management and ATIS Systems

This paper describes improvements to DECIPHER, the speech recognition component in SKI's Air Travel Information Systems (ATIS) and Resource Management systems. DECIPHER is a speaker-independent continuous speech recognition system based on hidden Markov model (HMM) technology. We show significant performance improvements in DECIPHER due to (I) the addition of tied-mixture I-IMM modeling (2) rej...

متن کامل

Performance of SRI's Decipher TM Speech Recognition System on DARPA's CSR Task

The system was evaluated on the speaker-independent (SI) portion of DARPA's February 1992 "Dry-Run" WSJ0 test and achieved 17.1% word error without verbalized punctuation (NVP) and 16.6% error with verbalized punctuation (VP). In addition, we increased the amount of training data and reduced the VP error rate to 12.9%. This SI error rate (with a larger amount of training data) equalled the best...

متن کامل

SRI's Real-Time Spoken Language System

PROJECT GOALS This project involves the integration of speech and natural-language processing for spoken language systems (SLS). The goal of this project, to develop a multi-modal interface to the Official Airline Guide database, is being developed along two overlapping research and development lines: one focussed on an SLS kernel for database query, and the other on the full interactive system...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1990